A selectivity model for fragmented relations: Evaluated for different standard data distributions
نویسندگان
چکیده
In the estimation of selectivity, many models assume that data is uniformly distributed, which is not true for many applications. In this paper, we discuss a generalized selectivity model, the so-called lαβ-model which is independent of the data distribution. The model predicts the fraction of a relation that should be selected in order to process a query. We have evaluated this model for different data distributions in order to determine the accuracy of this model. Data distributions that have been considered are the uniform distribution, the normal distribution, the exponential distribution, Pearson’s distribution, and Zipf’s distribution. From our experiments, it appears that the lαβ-model predicts the selectivity well, especially for the skewed distributions. Applying the lαβ-model on different fragment sizes of a relation yields quite acceptable selectivity values as well.
منابع مشابه
The Analysis of Bayesian Probit Regression of Binary and Polychotomous Response Data
The goal of this study is to introduce a statistical method regarding the analysis of specific latent data for regression analysis of the discrete data and to build a relation between a probit regression model (related to the discrete response) and normal linear regression model (related to the latent data of continuous response). This method provides precise inferences on binary and multinomia...
متن کاملA selectivity model for fragmented relations in information retrieval
New application domains cause todays database sizes to grow rapidly, posing great demands on technology. Data fragmentation facilitates techniques (like distribution, parallelization, and main-memory computing) meeting these demands. Also, fragmentation might help improving efficient processing of query types such as top N. Database design and query optimization require a good notion of the cos...
متن کاملImplementation of Hyperbolic Tangent Function to Estimate Size Distribution of Rock Fragmentation by Blasting in Open Pit Mines
Rock fragmentation is one of the desired results of rock blasting. So, controlling and predicting it, has direct effects on operational costs of mining. There are different ways that could be used to predict the size distribution of fragmented rocks. Mathematical relations have been widely used in these predictions. From among three proposed mathematical relations, one was selected in this stud...
متن کاملEvaluation and Application of the Gaussian-Log Gaussian Spatial Model for Robust Bayesian Prediction of Tehran Air Pollution Data
Air pollution is one of the major problems of Tehran metropolis. Regarding the fact that Tehran is surrounded by Alborz Mountains from three sides, the pollution due to the cars traffic and other polluting means causes the pollutants to be trapped in the city and have no exit without appropriate wind guff. Carbon monoxide (CO) is one of the most important sources of pollution in Tehran air. The...
متن کاملRecurrence Relations for Moment Generating Functions of Generalized Order Statistics Based on Doubly Truncated Class of Distributions
In this paper, we derived recurrence relations for joint moment generating functions of nonadjacent generalized order statistics (GOS) of random samples drawn from doubly truncated class of continuous distributions. Recurrence relations for joint moments of nonadjacent GOS (ordinary order statistics (OOS) and k-upper records (k-RVs) as special cases) are obtained. Single and product moment gene...
متن کامل